The 2004 BBN 1xRT recognition systems for English broadcast news and conversational telephone speech
نویسندگان
چکیده
This paper describes the BBN real-time recognition systems used in the 2004 Rich Transcription (RT) benchmark test for the English Conversational Telephone Speech (CTS) and Broadcast News (BN) tasks. We describe the system architecture, along with the algorithms we used in order to reduce computation with minimal impact on recognition accuracy. Particular choices in the design of the final system are analyzed to show the trade-offs between speed and accuracy. We also present recently developed new architecture for the real-time systems, which outperforms the systems we submitted for the RT04 benchmark tests for both domains.
منابع مشابه
The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system
In this paper we describe the English Conversational Telephone Speech (CTS) recognition system jointly developed by BBN and LIMSI under the DARPA EARS program for the 2004 evaluation conducted by NIST. The 2004 BBN/LIMSI system achieved a word error rate (WER) of 13.5% at 18.3xRT (realtime as measured on Pentium 4 Xeon 3.4 GHz Processor) on the EARS progress test set. This translates into a 22....
متن کاملJapanese broadcast news transcription
In this paper, we describe the on-going development of a Japanese Broadcast News Transcription system at BBN Technologies. This is a collaboration between BBN and NHK to use automatic speech recognition technology to provide live closed caption for NHK’s TV news programs in Japan. We describe what the NHK Broadcast News Corpus comprises and how we adopted transcription technology developed for ...
متن کاملImprovements to the BBN RT04 Mandarin conversational telephone speech recognition system
BBN’s 20 times real-time (20xRT) Mandarin conversational telephone speech (CTS) recognition system achieved the lowest character error rate (CER) in the Rich Transcription 2004 fall (RT04F) evaluation conducted by NIST. This paper focuses on the work we have done after the evaluation. The work includes porting of more new acoustic modeling technologies we had developed on English, such as long-...
متن کاملImproving Automatic Sentence Boundary Detection with Confusion Networks
We extend existing methods for automatic sentence boundary detection by leveraging multiple recognizer hypotheses in order to provide robustness to speech recognition errors. For each hypothesized word sequence, an HMM is used to estimate the posterior probability of a sentence boundary at each word boundary. The hypotheses are combined using confusion networks to determine the overall most lik...
متن کاملAutomatic Classification and Transcription of Telephone Speech in Radio Broadcast Data
Automatic transcription of telephone speech involves additional challenges compared to wideband data processing, mainly due to channel limitations and to particular characteristics of conversational telephone speech. While in TV speech recognition applications, such as automatic transcription of broadcast news, the presence of telephone data is nearly insignificant (less than 1 %), in most radi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005